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Abstract — We consider a transmission of a delay-sensitive 
data stream from a single source to a single destination. The 
reliability of this transmission may suffer from bursty packet 
losses - the predominant type of failures in today's Internet. 
An effective and well studied solution to this problem is to 
protect the data by a Forward Error Correction (FEC) code 
and send the FEC packets over multiple paths. In this paper 
we show that the performance of such a multipath FEC scheme 
can often be further improved. Our key observation is that the 
propagation times on the available paths often significantly differ, 
typically by 10-100ms. We propose to exploit these differences by 
appropriate packet scheduling that we call 'Spread'. We evaluate 
our solution with a precise, analytical formulation and trace- 
driven simulations. Our studies show that Spread substantially 
outperforms the state-of-the-art solutions. It typically achieves 
two- to five-fold improvement (reduction) in the effective loss rate. 
Or conversely, keeping the same level of effective loss rate, Spread 
significantly decreases the observed delays and helps fighting the 
delay jitter. 

I. Introduction 

We consider a transmission of a delay-sensitive data stream 
from a single source to a single destination. How to improve 
the reliability of this transmission? Traditional ARQ (Au- 
tomatic Repeat-reQuest) mechanisms often cannot be used, 
as they impose additional and usually unacceptable delays 
of at least one RTT (Round Trip Time). A more applicable 
technique is to introduce some type of redundancy, e.g., 
Forward Error Correction (FEC). Clearly, due to the delay 
constraints, a FEC block must be of limited length [1]. This, 
in turn, makes it inefficient against bursty packet losses [1] 
- the predominant type of losses in today's Internet [2]. A 
good solution to this problem is to assign the FEC packets 
to multiple paths spanning the source and the destination [3]- 
[10]. An illustration of a multipath FEC system is presented in 
Fig. Q] Theoretically, the multiple paths could be constructed 
with the help of source routing, but this technique is not yet 
fully available in the Internet. A more practical alternative is 
the usage of overlay relay nodes that forward the traffic (as 
in Fig. [TJ. If the resulting paths are statistically independent, 
which is especially likely for multi-homed hosts, then the loss 
bursts get averaged out and FEC regains effectiveness. Similar 
performance benefits due to multipath were also observed in 
the context of Multiple Description Coding [11]. 

When designing a system that splits a FEC block across 
multiple paths, we have to (1) select some paths out of all 
candidates, (2) assign the transmission rates to these paths, 




Fig. I . Illustration of a multipath system with R = 3 paths Pi, P2, P3 
between source s and destination d. ti,t2>*3 a 1 ' 6 the corresponding path 
propagation times, k data packets are complimented with n — k redundancy 
packets, and the resulting n FEC packets are split onto the three paths using 
the rates 711,712 and 713, respectively. 

and (3) schedule the packets. The previous studies proposed 
techniques to solve (1-3) as a function of the statistical loss 
properties of the paths [4,5,10]. 

However, there are other important parameters affecting the 
performance of the multipath FEC system. In particular, in 
this paper we show that the propagation times on the available 
multiple paths often significantly differ. These differences, in 
turn, can be exploited to improve the system reliability. Below, 
we explain and motivate our approach on concrete examples 
and measurements. 

A. Propagation times on direct and indirect paths may differ 
significantly 

In Fig. [2] we study the path propagation time differences in 
the real-life Internet. The measurements were collected by run- 
ning all-to-all traceroutes between 326 nodes in DIMES [12]. 
These nodes are usually private hosts located at different sites 
around the world. (We obtained similar results for measure- 
ments on PlanetLab [13].) 

For each source-destination pair we construct a set of R 
paths. We always include the direct path Pi with propagation 
time t\. Each of the remaining R — 1 paths is indirect, i.e., 
it uses some overlay relay node to forward the traffic. We 
choose uniformly at random a number C of candidate relay 
nodes among the remaining 324 DIMES nodes. This results 
in C candidate indirect paths. From them we select the R — 1 
indirect paths following the intuitive selection procedure given 
in [5]. For R = 2 paths we choose the indirect candidate path 
that is the most IP link disjoint with the direct path Pi . Clearly, 
this minimizes the loss correlation between Pi and P^ - If there 
are more paths that achieve the minimal IP overlap, then the 
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Fig. 2. The difference between propagation times on the direct path Pi 
and the best indirect paths P2 and P3. We present the results for R = 2 
(two paths: one direct and one indirect) and R = 3 (the direct path and two 
indirect paths). The histograms (top) show the distribution of propagation time 
differences for C = 5 available candidate indirect paths. The table (bottom) 
shows the medians of these distributions for C = 1 , 5 and 50. The averages 
(not shown) are systematically higher than the medians. 
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one with the smallest propagation time is kept. For R > 2 we 
proceed similarly, except that now we consider the aggregated 
values of IP overlap and propagation time, i.e., summed over 
all R(R — l)/2 possible path pairs. 

According to Fig. |2] for R = 2, the best indirect path P2 
has propagation times larger by typically . . . 75ms than the 
direct path Pi (see tz-^ti in top-left histogram). This difference 
gets larger for a smaller number of candidates C (table at the 
bottom). 

Moreover, the path propagation time differences grow sig- 
nificantly with the number of paths R used in the system. 
As shown in Fig. HI already for R = 3 the medians of the 
distributions are roughly doubled compared to R = 2, and 
typically Pi is faster than the slower of the two indirect paths 
by max(^2, £3)— 1\ ~ . . . 150ms. 

We conclude that in the real-life Internet the propaga- 
tion time differences on multiple paths between a source- 
destination pair are significant, typically reaching several tens 
of milliseconds. 

B. The differences in propagation times can be exploited by a 
multipath FEC system 

We propose to exploit these path propagation time differ- 
ences when designing a multipath FEC system. Our solution 
is easy to implement and can bring significant performance 
gains. Let us take a concrete example described in Fig. [3] 
Assume that there exist two paths between the source and 
the destination, the direct path Pi, and an indirect path P2 
created by employing another peer that works as a relay. Let 
ti = 100ms and £2 = 150ms be the propagation delays on Pi 
and P2, respectively. So the path propagation time difference 
is At = 50ms (Fig. EK)- Let the two paths be lossy with 
the same loss rate 1% and the same average loss burst length 
of 10ms, but independent. The data packets are generated at 
the source every T — 5ms. If no form of packet protection is 
used, then the data packet loss rate observed at the destination, 
or the effective loss rate, is n* B = 1% (B). Assume now that 
we use FEC(6,4) to protect the packets. If we send all packets 



Fig. 3. Illustration of various packet schedules and their performance 
measured in the effective loss rate itg. We use two independent paths Pi 
and P2 with identical failure distributions. The data packets are generated 
at the source every T = 5ms and coded with FEC(6,4). (A) The path 
propagation time t2 on path P2 is At — ti — 1\ = 50ms larger than the 
path propagation time ti on path Pi. (B) No FEC, single path, the packets 
are sent at times 0, 5, 10, 15ms. (C) FEC on Pi only, packets are sent 
as soon as they are generated, i.e., we use the 'Immediate' schedule $ lmm . 
(D) Packets alternate between Pi and P2 with equal rates rates ni=ri2 = 3, 
as in [5,10]. The total FEC block delay resulting from this scheme serves as a 
maximal FEC block delay in the following scenarios. (E) Packets alternate 
between Pi and P2 with equal rates, but the three packets sent on Pi are 
maximally spread. (F) Packets are split between Pi and P2 with optimal 
rates ni = 4, 712 = 2, maximally spread. 

on Pi with inter-packet times T, then the effective loss rate 
after FEC decoding is tt* b = 0.553% (C). Following [5,10], 
we now split the packets equally between Pi and P2, which 
decreases tt b to 0.148% (D). This solution represents the state 
of the art. Note that now the last FEC packet on path P2 
reaches the destination tpEC = ^2+4 • T = 170 milliseconds 
after the generation of the first FEC packet at source. In other 
words, in this case the application using multipath FEC must 
accept the (maximal) delay equal to £fec- However, we can 
achieve far better results still respecting this delay constraint. 
For instance, we can appropriately increase the packet-spacing 
on Pi and achieve ir B =0.113% (E). Finally, we get even more 
significant improvement by sending 4 packets on Pi and 2 
packets P%, i.e., by applying unequal rates on the paths (F). 
This results in n* B = 0.016%, which is almost one order of 
magnitude smaller than (D). 

In other words, we exploit the differences in path prop- 
agation times by spreading the packets in time, such that 
the maximal allowed delay is respected. The gain over the 
state of the art measured in the effective loss rate ir* B may 
be very significant (here 0.016% vs 0.148%, i.e., almost ten- 
fold). Moreover, some results may seem counterintuitive. For 
instance, it may be better to use only one path than to use 
two (un-spaced) paths. It also turns out that even if the loss 
distributions on the paths are the same, the optimal rates 
assigned to these paths are not necessarily equal. 



C. Organization of this paper 

The remainder of this paper is organized as follows. In 
Section [II] we fully specify our model, which allows us to 
precisely state the problem we are solving. Next, in Section Hill 
we derive exact analytical expressions for the effective loss 
rate n* B under multipath FEC and an arbitrary schedule. In 
Section |IV] we describe the 'Immediate' schedule representing 
the state of the art, and propose a 'Spread' schedule that 
exploits the differences in path propagation times. In SectionlVl 
we evaluate our solution analytically, by simulations and by 
trace-driven simulations fed with real-life Internet traces. In 
Section [VT] we discuss the related work. Finally we conclude 
the paper and propose future directions. The details of some 
calculations are put in Appendix. 

II. Model and problem statement 

The packets, called data packets, are generated at source s, 
with constant inter-arrival time T. There exist R paths be- 
tween sender s and destination d, with the propagation delays 
tx,---,tR, respectively. 
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probability, expected value 
source node 
destination node 

(constant) interval between two consecutive data packets 
at source s 

number of independent paths between source s and desti- 
nation d 
rth path 

propagation delay on P r 

the average loss rate and loss burst length on path P r 
the number of FEC, data, and redundancy packets in a 
FEC block, respectively 

number of FEC packets assigned to P r (rate of path P r ) 
number of data packets assigned to path P r 
(constant) spacing of the n r packets on path P r 
number of lost FEC and data packets before FEC recovery 
effective loss rate, i.e., the expected fraction of lost data 
packets at the destination after the FEC recovery 
FEC block transmission time, i.e., the time between the 
generation of the first FEC packet at source s and the 
scheduled delivery of the latest FEC packet at destination d 
packet scheduling: The ith packet in a FEC block is sent 
at time T(i) over path TZ(i) 



TABLE I 

Basic notation used in this paper. 



A. Path losses 

The paths are assumed to be independent. We model bursty 
losses on each path by the popular two-state Gilbert model. 
Its basic version is a Discrete Time Markov Chain (DTMC), 
and captures the loss correlations due to queuing on bottleneck 
links, when the path is sampled at some constant rate (e.g., 
1/T). However, as we vary the sampling rates, DTMC is 
not sufficient. Indeed, on the same path we experience much 
higher loss burstiness under the packet interval of T = 5ms 
than of T = 100ms [1]. For this reason we use the continuous- 
time version of the Gilbert model [4,14] that naturally accom- 
modates different sampling rates. It is a two-state stationary 
Continuous Time Markov Chain (CTMC) {X r (t)}. The state 
X r (t) at time t assumes one of the two values: G ('good') or 
B ('bad'). If a packet is sent at time t and X r (t) = G then 
the packet is transmitted; if X r (t) = B then the packet is lost. 

(r) (r) 

We denote by n G and ir B the stationary probabilities that 

(r) 

the rth path is good or bad, respectively. Similarly, let fj, y r ' 

(r) 

and u B ' be the transition rates from G to B and from B to 
G, respectively. In this paper we use two meaningful, system- 
dependent parameters to specify the CTMC packet loss model: 

(r) 

* the average loss rate n B , and 

(r) 

« the average loss burst length (in seconds) l/fi B '. 
All other parameters can be easily derived from these two, 
because 
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B. Multipath FEC 

We use a FEC(n, k) scheme to protect the data packets 
against losses (see Fig.Q]). This means that k data packets (not 
necessarily consecutive) are encoded as one FEC block of n 
packets, called FEC packets. In particular, as in [4,6,10,15,16], 
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Fig. 4. An illustration of a schedule <S = (T, TZ) on R = 2 paths with 
FEC(6,4). Four data packets numbered 1-4 are generated at the source at equal 
intervals T; the first one specifies time t = 0. The n — k = 2 redundancy 
packets are numbered 5 and 6. According to the schedule <S = (T, 1Z), the 
ith FEC packet is sent at time T(i) > over path 7Z(i). 



we consider a systemati^ FEC, i.e., a scheme where the first 
k packets are the k data packets (unchanged). The remaining 
n — k packets, called redundancy packets, carry the redundancy 
information. The destination uses the redundancy packets to 
recover some of the lost data packets as follows. Let F be the 
number of lost FEC packets and let D be the number of lost 
data packets of a FEC block, both before the FEC recovery 
(note that D contributes to F). If F < n — k then all the n 
FEC packets and hence all the k data packets are recovered. 
In contrast, if F > n — k, then no FEC recovery is possible 
and D data packets are lost. 

C. Packet scheduling 

Finally, the packets are sent according to some schedule that 
defines when and on which path each FEC packet is sent. More 
precisely, we denote by S — (T, TV) the schedule of packets 
in a FEC block, where T and 1Z are vectors of length n. The 

'The non- systematic FEC is easier to handle, but also less efficient. We 
show its analysis in Appendix. 



ith FEC packet is sent at time T(i) over path lZ(i), as shown 
in Fig. [4] The time is counted from the generation (at the 
source) of the first data packet of the FEC block. Denote by 
ipEC the FEC block transmission time, i.e., the time between 
the generation of the first FEC packet at source s and the 
scheduled delivery of the latest FEC packet at destination d. 
Given a schedule S, ipEC can be easily computed as 



^FEC 



max 

Ki< 



(2) 



For a given schedule, ipEC can be interpreted as the total 
delay imposed by the multipath FEC system on the delay- 
sensitive application using it. Indeed, if the first packet of a 
FEC block is lost and needs to be reconstructed by FEC, then 
we have to wait up to £fec until the destination is reached by 
the other FEC packets necessary for the reconstruction of the 
lost packet. In practice, however, a constraint is likely to come 
from the delay-constrained application itself, as the maximal 
acceptable delay £fec- In this case our goal is to design a good 
schedule respecting this constraint, which is the approach used 
in this paper. 

The schedule also implicitly defines the rate n r of path P r , 
i.e., the number of FEC packets sent on P r , Similarly, let k r 
be the number of data packets among the n r packets sent on 
P r . Clearly, n r — 71 an d J2r = k. 

D. Effective loss rate tt b and problem statement 

Our ultimate goal is to send a stream of data packets over 
(possibly multiple) lossy channels in a way that minimizes the 
losses observed at the destination, given a maximal value for 
^fec- Therefore, we adopt a natural performance metric called 
effective loss rate tt b . It is defined as the expected fraction of 
lost data packets observed at the destination d after an attempt 
of FEC decoding. Now the problem can be stated as follows: 

(r) (r) 

Given the path loss properties (ir B , and t r for 

every path P r ), the FEC parameters (n and k) and maximal 
FEC block transmission time t^EC* find the schedule S that 
minimizes the effective loss rate tt b . 

We approach this problem in two steps. First, in Section Hill 
we derive an exact analytical formula for the effective loss rate 
n B for a given schedule S. Second, in Section[lV]we introduce 
a schedule that exploits the differences in path propagation 
times and outperforms the schedules proposed to date. 

III. Exact analytical derivation of the effective 

LOSS RATE TTg 

In order to design a good schedule we must be able to 
evaluate it. In this section we derive the exact analytical 
expression for the effective loss rate ir B for a given schedule S. 
We consider two cases. First, we derive ir B for an arbitrary 
schedule S. The resulting formula is simple but computa- 
tionally expensive and untractable for larger sizes n of the 
FEC block. Next, we derive tt b assuming that on each path 
separately the packets are evenly spaced. This constraint is 
compatible with the schedule we propose later and results in 
a computationally lighter formula for tt b . 



A. The effective loss rate tt b for an arbitrary schedule 

Let c be a n-tuple representing a particular failure config- 
uration; Cj, l<i<n, takes the value G (resp., B) if ith FEC 
packet is transmitted (resp., lost). By considering all possible 
failure configurations c we can compute the effective loss rate 
tt b for a given schedule S as follows: 

1 NT- 



11 b = j:2^d(c)-p(c), 



(3) 



where D(c) is the number of lost data packets (after the FEC 
recovery) for a given failure configuration c. For a systematic 
FEC(n, k) we have 

Die) = { ° k if 1{Cl=B} " n ~ k 

I E J= i 1 {c i =B} otherwise. 

In order to compute the probability P(c) of a failure con- 
figuration c, we consider the R paths separately, as follows. 
Denote by T^> the vector of length n r with departure times 
of packets scheduled by S on path P r . Similarly, let be 
an n r -element vector with the failure configuration on path P r 
defined by c. As the R paths are independent, the probability 
P(c) is 

R 

p( C )=n p ( c(r) )' w 

r=l 

where P(c^ r ^) is the probability of a failure configuration 
on path P r . The derivation of P(c^ r ^) for the Continuous Time 
Markov Chain loss model is straightforward. Indeed, denote 
by p\j (r) the probability of transition from state i to state j 
on path P r in time r, i.e., 



p^(r)=F[X r (r)=j\X r (0)=i}. 
From classical Markov Chain analysis we have: 
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where a = exp (— (/% +fJ> B )t). Now P(c^ r ^) can be easily 
computed. For example, for c < - r ' = GBB we have 

nc {r) = GBB) = 4 } (n) 



where Tj is the time interval between the ith and (i+l)th FEC 
packet sch 
generally, 



packet scheduled by S on path P r , i.e, = T-\_{ — . More 



P(c (r) ) 

Finally, we plug (O and (01 to (O, to obtain 
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Fig. 5. The time complexity of the effective loss rate -Kg under an arbitrary 
schedule jlll-At vs. the even-spaced schedule jlll-Bt : time_ratio is the runtime 
of Eq. divided by the runtime of Eq. (TTJ. Here we use FEC(n, 0.7n) on 
two identical paths. 



B. The effective loss rate tt b for even spacing on paths 

Equation © allows us to compute the effective loss rate 
tt* b for any schedule S. However, evaluating (0 is computa- 
tionally expensive because the main sum is over all 2™ failure 
configurations. Thus it can be applied to relatively small n 
only. Fortunately, we can significantly reduce the computation 
complexity by assuming that on each path P r separately the 
packets are evenly spaced, i.e., for all 1 < i < n r —l the intervals 
Ti+i—Ti are the same and equal to a constant that we denote 
by T r . Indeed, this constraint leads us to a formulation of ir B 
(below) that may take orders of magnitude less time to solve 
than (0, as shown in Fig. [5] 

In order to compute ir B under the even-spacing case, we 
look closer at the packets lost on every path separately. Denote 
by F r and D r the number of FEC and data packets lost on 
path P r , respectively (both before FEC recovery). Now we can 
rewrite the total number of lost FEC packets as F = ^2 r F r 
and the total number of lost data packets as D = ^ r D r . This 
decomposition leads us to the following derivation of ir B : 
1 " 

= r E nF=j)-nD\F=j} = 



j—n—k+1 
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According to Equation ((H), in order to evaluate ir B , for every 
path P r separately we need to calculate two components: (i) 
the probability P(F r =j r ) that j r FEC packets are lost, and (ii) 
the expected number K[D r \F r = j r ] of lost data packets given 
that j r FEC packets were lost. We achieve this by applying 
an approach similar to the one used in [15] in the context of 
a single path FEC, as follows. 

We consider a path P r and a set of all n r FEC packets sent 
on P r with equal packet interval T r . Denote by [?] the event 



that any b out of a consecutive packets are lost(J We allow for 



concatenation of events, e.g., G[£] (resp., [?]-B) means that 
any b out of a block of a consecutive packets are lost and that 
this block is preceded by a good packet (resp., followed by a 
bad packet). We can now compute P(F r = j r ) by conditioning 
on the state of the first packet that conforms the packet loss 
stationary distribution: 

P(F r - j r ) = P(G [^r 1 ]) + F(B [IZt]) = 



Jr) 



1 g ■nri i ]\G)+ n^-P([l^}\B), (9) 

where P([£] \q), q G {G, B}, is the probability that any b 
out of a consecutive packets are lost given that this block 
is preceded by a packet in state q. Although no general 
closed form of P([£] \q) is known, it can be calculated by 
the recursive approach first proposed in [17] and extended 
e.g. in [4,15]. We show in Appendix the details of this 

(r) (r) 

computation. It takes tt b , l/fX B and T r as parameters, and 
directly uses the relations (0 above. 

In order to find E[£V|.F r = j r ], we first derive P(D r — 
i, F r =j r ). Let us consider the k r data packets and the n r —k r 
redundancy packets separately, and additionally condition on 
the state of the last data packet as follows. 



(D T F T j r 
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r«P([Y] |G)P(Hdl \G) + nPnftf] |S)P(^1 \B). 
The first equality uses the Markov property of the loss model: 

¥(D r = i, F r = j r | last data packet is q) = 
= P(D r = i | last data packet is q) ■ P(-F r =j r | last data packet is q), 

where q S {G, B}. Now it is easy to calculate K[D r \F r = j r ], 
because 



i[D r \F r =j r ] = 



2^0 



i^D r — i. F r — j r ) 
HF r =j r ) 



(10) 



We plug (|9) and ( flO) into ((H) and obtain a complete formula 
for the effective loss rate ir* a : 
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-The form of [ ? ] is inspired by the similarity with the binomial coefficient. 



where every term of type P([£] |G) or P([°] \B) is calculated 
through the set of recursive equations given in Appendix. 

To the best of our knowledge, Equation ( fTTT i is the first exact 
solution of this model. Indeed, all previous works used some 
approximations of E[D r \F r = j r ). In [4] the authors approxi- 
mate E[D r \F r = j r ] by assuming that any configuration of j 
losses among the n FEC packets is equally likely; we call this 
approach 'Golubchik'. In [6,16] the authors use an intuitive 
linear formula, i.e., ¥,(D r \F r = j r ) = —jr- Although not 
mentioned in the papers this is only an approximation that is 
exact only when k r ,n r — > oo; we refer to it as 'Proportional'. 
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Fig. 6. Approximations of E[D r \F r 
given by jlOj . Here n r = 10, A; r = S 



: j r ] normalized by the correct value 
7r^ r) = 0.01 and = 2. 



We illustrate the differences between these approximations and 
the real values in Fig. [6] 

IV. The design of the schedule S 

In the previous section we derived an exact analytical 
formula for the effective loss rate ir B under a given schedule S. 
Here we focus on the design of a good schedule that results 
in small ir* B . 

Not all schedules are applicable in practice. Indeed, both 
(i) the maximal allowed FEC block transmission time t PE c 
and (ii) the packet interval T at the source impose important 
scheduling constraints. We say that a schedule is feasible if 
all the three following conditions are satisfied: 

CI T(i) > (i - 1) • T for l<i< k, i.e., no data packet is 
sent before it is generated at the source. 

C2 T(i) > (k— 1)-T for k < i < n, i.e., no redundancy packet 
is sent before all data packets have been generated (we need 
to collect all data packets in order to create the redundancy 
packets). 

C3 T(i) + tfiu-\ < tFEC for 1 <i < n, i.e., all FEC packets 
should arrive at the destination before the deadline. 

We assume that the path rates m, . . . , tir are fixed. There 
are usually a variety of feasible schedules. Below we discuss 
two classes of schedules we use in this paper. The first one, 
called Immediate, reflects the state of the art, whereas the 
second one, Spread, is our proposal. 

A. 'Immediate ' packet scheduling S lmm - state of the art 

We denote by Immediate the schedule S lmm = 
(j-imm mat represents the approach used in [4]- 

[7,9,10]. As the name suggests, Immediate sends the data 
packets as soon as they are generated, i.e., every time inter- 
val T. The redundancy packets use the same spacing T. So in 
general 



T imm (i) = (i-l)-T forl<i<n. 



(12) 



This specifies when the FEC packets are sent, but not on which 
path. A good and commonly used guideline for TZ lmm is to 
spread the packets on each path separately with (roughly) even 
spacing [10]. When the rates are equal, i.e., ni=7i2=- • -=tir, 
then this boils down to a simple round-robin schedule applied 



in [4,6,7,9]. In contrast, when the rates differ, a more elaborate 
approach should be used. For this purpose we adopt the credit- 
based technique proposed in [10], as follows. Each path is 
associated with a credit initially equal to 0. Before each FEC 
packet transmission the credit of every path P r is increased 
by n r /n. Next, the path with the largest credit is selected to 
transmit this packet; the credit of this path is decreased by 1 . 
This scheme is iterated until all n FEC packets are sent. 

The Immediate schedule can be interpreted as the following 
function: 

S imm = Immediate(ni ...n R , T) 

Two examples of Immediate schedules S imm are given in 
Fig. [3] (C) is a single-path schedule, i.e., with n\ = 6 and 
ri2 = 0, whereas in (D) we use two paths and n\ = «2 = 3. 

B. 'Spread' packet scheduling S spr - our proposal 

Under Immediate, all packets are sent as soon as they 
are generated, i.e., according to (fT2l . Instead, we propose to 
spread the packets evenly in all the available time on each 
path. We call this schedule Spread S spr = (T spr ,K spr ). 
Compared with Immediate, Spread additionally takes the path 
propagation times t\. . Ar and the maximal FEC block delay 
£pg£, as parameters, i.e., 



S spr — Spread(ni . . . tir, T, t\ 



, ,spr \ 
■ l Ri "TEC/- 



The design of Spread is not straightforward. Indeed, as 
the k data packets are generated at the source with spacing T, 
the paths are inter-dependent, which may easily lead to the 
violation of the constraint CI. For example, if we schedule 
packet 1 on P\ at time T(l) = (and k > 1), then no other 
packet on any path can be scheduled before time t = T. 

We guarantee the feasibility of Spread as follows. First, we 
order the paths according to their rates, starting from the path 
with the highest rate. (When two paths have the same rate, 
we take the one with a higher path propagation time first.) 
We consider the paths one by one following this order. For 
each path P r we spread the packets evenly on time interval 
[t' r i^fec - t r ], where takes the smallest possible value that 
satisfies the feasibility condition. (The value of usually 
grows with the number of paths processed.) We iterate this 
algorithm until all paths have been scheduled. 

We present two examples of Spread schedules S spr in 
Fig. |3] We use tp^ = 170ms and two different sets of rates: 
rii = 712= 3 in (E) and ni=4, U2 — 2 in (F). 

Spread is very effective. Indeed, we can prove that 

Theorem 1: The Spread schedule is optimal for the repeti- 
tion code FEC(n, 1). 

Proof of Theorem Q] Under FEC(n, 1) every data packet is 
replicated and sent in n copies; the reception of at least one 
such copy leads to a success. As there is only one data packet, 
all the redundancy packets (i.e., the duplicates of the data 
packet) can be generated already at time t = 0. This eliminates 
all the time dependencies between the paths. Therefore, every 
path P r separately must maximize the probability of at least 
one successful transmission. It is achieved by even spreading 



on the maximal allowed time interval [0, t^ c —t r ]. (The proof 
for the under repetition code on a single path can be found 
in [14].) This, in turn, is exactly what Spread returns under 
FEC(n,l). a 

Spread builds on even packet spreading - a simple and 
widely accepted guideline that is often thought of as leading 
to the optimal solution. Indeed, its optimality was proven for 
some particular cases [14]. But, surprisingly, this is not a 
general result. Consider for example FEC(4,3) on a single 
path (i.e., K = (1,1,1,1)) with loss rate n$ = 1% 
and average loss burst length l/(J, B — 5ms, and available 
time interval equal to 15ms. The even spreading schedule 
Si = ((0,5, 10,15), 11) yields tt* b = 0.53%. But the optimal 
schedule (found with optimization tools of Mathematica [18]) 
is Si = ((0,7.16, 12.51, 15), 11) and yields ir* B = 0.50%. 

This means that Spread does not guarantee optimality in the 
general FEC(n, k) case. However, we show later in simulations 
that it usually leads to almost-optimal solutions and is thus an 
effective and practical rule of thumb. 



opt 



C. Comparison of S lmm and S spr : Optimal schedules S, 
and S^l, and loss rate improvement 7. 

It was shown in previous studies that Immediate multipath 
is better than a single path communication. The main point 
we make here is that once we allow for multipath, the Spread 
schedule S spr that we propose in this paper is significantly 
better than the Immediate schedule S mm representing the 
state of the art. 

In order to demonstrate this, we compare the performance 
of S tmm and S spr in terms of their effective loss rates. What 
rates n\ . . . tir and what FEC block transmission time £fec 
should we use to make this comparison meaningful and fair? 
We should allow Immediate and Spread to optimize their rates 
ril . . . riR independently, given that they impose identical FEC 
block transmission times ipg 1 ™ = ippc- More precisely, we 
assume that the FEC parameters n and k are fixed, and we 
proceed in two steps. First, we optimize the rates ni . . . tir of 
Immediate, such that the effective loss rate tt* b is minimized. 
It results in the optimal Immediate schedule <S^ m . This, in 
turn, specifies t^f 1 as shown in (ffj. In the second step, we 
set tpg£, ~ ip™c" and optimize the rates ni . . . ur of Spread, 
resulting in the optimal Spread schedule 

Finally, we define the relative effective loss rate improve- 
ment 7 as the relative gain in n* B due to the usage of optimal 
Spread instead of optimal Immediate, i.e., 



' B \°opt ) 



n B\°opt ) 



(13) 



The metric 7 can be precisely evaluated by formulas (|7]i and 
(fTTT i. The values of 7 can be easily interpreted; for example, 
7 > 1 means that Spread performs better than Immediate. 

3 Note that 5*^J m a nd are optimal subject to their construction 

constraints presented in IIV-AI and UV-Bl respectively. 



D. Capacity constraints 

So far we have considered the case where every path P r can 
be assigned with any rate 0< n r < n. In practice, however, P r 
may have a relatively limited capacity, which would impose 
a direct constraint on n r . Fortunately, integrating these con- 
straints in our model is straightforward. Indeed, it is enough 
to respect them when computing the rates m . . . ur in S^ 

V. Performance evaluation 

In this section we evaluate our approach first in simulations 
and next on real-life traces. 

A. Simulation results 

The goal of simulations is twofold. First, we verify the 
correctness of our analytical results. Second, we can test our 
idea in a fully controlled environment and study the effect of 
various parameters on the results. 

1 ) Default values of parameters: If not stated otherwise, 
in our simulations we use the following default values. The 
data packets are generated at the source with interval T = 
5ms. Next they are encoded by systematic FEC(10, 8) and 
sent over R independent paths. For the sake of simplicity we 
speak mainly of systems with R = 2 paths: Pi and P2. It 
allows us to describe the path propagation time differences by 
a single parameter At — ti — ti that takes the default value 
At = 100ms. Finally, the paths Pi and P 2 have the same 
average failure rate tti = 0.01 and the average loss burst length 
equal to 1/fii = 10ms. 

2) The effective loss rate ir B as a function of At: In Fig. [7] 
we plot the effective loss rate tt b as a function of At for four 
different schedules. Our first observation is that the results 
obtained in a simulation of the model (circles) fit precisely 
the analytical curves (plain lines). 

Next, we compare the performance of various schedules. As 
the loss properties of the two paths are identical, the previous 
techniques described in [4,5,10] split the FEC packets equally 
between Pi and P^. This results in the optimal Immediate 
schedule <S*™ m = S l { ™^, i.e., with m = n 2 = 5. As this 
schedule uses multipath transmission, it is not surprising that 
^opt™ significantly outperforms the single path Immediate 
schedule 5™q™^ . Note also that, by construction, At does not 
affect the performance of any of them. 

In contrast, in Spread S^ r ^ we use the same rates as 
in iS*^ m , but we spread the packets uniformly within the 
time budget tpg^ 1 set by S^ 1 ^ (similar schedule is shown 
in Fig. |3ji). It results in a further decrease of the effective loss 
rate ir B . This difference moderately grows with At. However, 
for larger At the rates (5, 5) become suboptimal under Spread. 
For instance, in the inset in Fig. [7] we show the performance 
of Spread under various rate configurations (ni,n — ni); the 
minimum is reached for (7, 3). As descried in II V-CI allowing 
for this rate optimization leads to the optimal Spread schedule 
S^pl- Its advantage over S^ 1 ^ grows roughly exponentially 
with At. 
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The gain in FEC block transmission time tpgc by the usage of Spread 
instead of Immediate. Parameters: FEC(10,8), tt' 1 ' = tt' 2) = 0.01, = 
1 / [i B = 10ms, T = 5ms. For these parameters, the optimal Immediate rates 
are n\ =112 = 5, which results in the effective loss rate iTg(S^™^)= 0.24%. 
For Spread we choose the minimal FEC block transmission time tpg£ such 
that 7T? ^ ap M 



; (5 ^)<^(5 { ™). 



Fig. 7. The effective loss rate as a function of path propagation time 
difference At. We use FEC(10, 8) on two independent paths, Pi and P2, 
with data packet spacing T = 5 at the source. The losses on Pi and P2 
are modeled by continuous time Gilbert model with the same average failure 
rate ttb = 0.01 and the average burst length equal l//ij3 = 10ms. Four 
schedules are used: • - all packets are sent on a single path Pi 

with interval T, • - Immediate with optimal rates n\ = = 5, 

• 5?g g, - Spread with n\ = n,2 = 5, • - Spread with the rates 

ni,7i2 chosen optimally based on the value of At. Additionally, the 
dashed curve shows the effective loss rate of the optimal schedule, where 
packets are not restricted to even spacing on each path, as described in 
Section llVl The optimal schedule was found with numerical optimization tools 
of Mathematica [18]. Inset: tt* b as a function of rate n\ on path Pi for 
At = 50ms under Spread. In both figures the plain lines are the theoretical 
values according to formula jilt , whereas the circles are the results obtained 
in a simulation of the model. The size of confidence intervals (not shown) is 
comparable with the size of the circles. 

Finally, we observe that the performance of the optimal 
Spread schedule S s ^ t is very close to the global optimum 
(dashed curve) where packets are not necessarily evenly- 
spaced, as described in Section |IV] This confirms the use- 
fulness of the even-spread guideline that we follow in Spread. 

3) Loss rate improvement 7 as a function of various pa- 
rameters: Clearly, there are many parameters that affect the 
performance of the schedules. We study the effect of some of 
them on the relative loss rate improvement 7 in Fig. [8] 

First, plot (A) confirms that the advantage of Spread 
over Immediate grows with the path propagation time differ- 
ence At. 

Second, with growing packet interval T at the source, the 
fixed At becomes a smaller fraction of the entire FEC block 
transmission time £fec- As a consequence, there is relatively 
less to exploit and 7 drops with T, see plot (B). A similar 
phenomenon can be observed in plot (C), where ipEC grows 
due to an increase of the number n of FEC packets. 

Finally, in Fig. |8j3 we vary the loss rate tt b ^ of path P%. 
The difference between path loss rates is a crucial parameter 
affecting the performance gain of Immediate multipath over 
the single path transmission. Indeed, if out of two paths 
one is very lossy and the other one is very good, then the 



optimal Immediate multipath schedule S 1 ™™ uses mainly (or 
only) the better path, which substantially limits the gain of 
multipath [5,7]. This is illustrated in plot (D) by the dashed 
curve; the ratio ir* B (Si™% ) /ir* B (<S*™ t m ) is the largest when the 
paths have identical loss properties, and quickly diminishes 
with growing difference between and tt b \ 

We could expect a similar diminishing effect for the advan- 
tage 7 = 7r^(5™ t m )/7r^(5^[) of Spread over Immediate. 

Surprisingly, this is not the case; 7 remains relatively stable 

(2) (2) 

(3 < 7 < 6) for a wide range of values of Tt B . For Tt B pa 0.25 
the path P2 becomes too lossy, and both Immediate and Spread 
send all packets on Pi only and thus become equivalent. 

4) Minimizing t?EC - decreasing delays and fighting jitter: 
So far we used Spread to minimize the effective loss rate n* B 
while keeping the FEC block transmission time tp^ c not larger 
than that of Immediate schedule tp™™. Let us now reverse 
the problem: Let us minimize the FEC block transmission 
time ipgp of Spread, while keeping its effective loss rate not 
larger than that of Immediate, i.e., subject to TT B (S^pl) < 

^B^opt )■ 

We plot the results in Fig. [9] The gain t^™ — ip^, in 
FEC block transmission time is significant and grows roughly 
linearly with At, as ^fec™ - ^fec — At/2. The reduction of 7;fec 
brings obvious advantages to delay-constrained applications 
using the multipath FEC system. First, the effective end-to- 
end delays get smaller which allows us to reduce the playout 
time at the destination, keeping the same level of the effective 
loss rate. 

Another important interpretation is related to the delay 
jitter, i.e., variations of path propagation times. Indeed, in 
this work, as in most previous works on multipath FEC, we 
consider the path propagation times constant and focus on 
(correlated) packet losses only. However, as Spread results in 
a smaller delay £feo it also leaves more space to accommodate 
potential jitter, naturally making Spread more robust to jitter 
than Immediate. 




Fig. 8. Relative loss rate improvement 7 due to usage of Spread instead of Immediate as a function of four parameters: (A) path propagation time difference 
At, (B) packet generation interval T at the source, (C) the size n of a FEC block, (D) loss rate n B of path P2. We consider a system with R = 2 paths 

(r) (r) 

and the following default parameters: FEC(10,8), At = 100ms, T = 5ms, n B = 1%, l/fJ. B = 10ms, k = n — 2. All results shown here are analytical. 
The irregular shapes of the curves in this and other figures are expected, because the computation of 7 involves the rates optimization (see llV-Cl . For instance, 
in figure (D), going from left to right, the optimal Immediate and Spread rates (m, 712) change gradually (and separately) from (5, 5) to (10, 0); every such 
rate transition may introduce irregularities in the shape of the curves. 



5) Other FEC parameters n, k: So far we assumed that 
Immediate and Spread use the same general FEC parameters 
n and k; only the rates on particular paths could be optimized. 
However, in some cases the optimal choice of n and k under 
Spread may differ from that of Immediate, given the same 
redundancy k/n. 

For example, according to our additional simulations (not 
shown here), for the setting in Fig. [7] and At > 220ms, the 
Spread schedule using FEC(15,12) would outperform Spread 
with FEC(10,8). Similarly, FEC(12,6) would be better for 
Spread than FEC(10,5) for At > 140ms. Note, however, that 
this phenomenon can be observed only for relatively large 
values of At that rarely occur in reality. 

To conclude, the loss rate improvement 7 can be manyfold, 
but its exact value strongly depends on various parameters. 
First, the advantage of Spread over Immediate grows with 
path propagation time differences At, but drops with the data 
packet generation interval T and FEC block size n. Second, the 
optimal Immediate rates are not always optimal under Spread; 
usually optimal Spread sends more packets on faster paths. 
Third, although the advantage of Immediate over a single 
path transmission quickly diminishes with growing differences 
between the loss rates of the paths, the advantage of Spread 
over Immediate is relatively stable. Finally, Spread can also 
achieve FEC block transmission times much smaller than 
Immediate, still guaranteing the same or better performance in 
terms of the effective loss rate. This results not only in smaller 
effective delays, but also in a higher robustness to delay jitter. 

B. Trace-driven PlanetLab evaluation 

In the previous section we presented analytical and sim- 
ulation results where the packet losses were modeled by 
the Continuous Time Gilbert Model. As any model, it is 
only an approximation of reality. In this section we feed 
our simulations with real-life packet loss traces collected in 
Internet experiments. 



1 ) Data sets: The traces come from two different PlanetLab 
(PL) [13] experiments. On every path the packets are sent with 
time-interval T, i.e., with the generation rate at the source. 
Every trace is a sequence composed of symbols G (packet 
not lost) and B (packet lost). 

Every time-constrained experiment on PlanetLab should be 
designed and interpreted carefully. This is because at any 
point in time most of PlanetLab nodes are overloaded. Not 
only their CPU utilization is at 100%, but more importantly 
the queueing delays experienced by the running processes 
can be very significant - even up to several seconds between 
two consecutive accesses to CPU. This results in incorrect 
propagation time measurements and packet dropping due to 
incoming buffer overflow at the destination [19,20]. Moreover, 
the situation changes dynamically. We minimize the effects 
of these problems by introducing periodic pauses in packet 
generation and avoiding the highly loaded PlanetLab nodes. 

We use the following two data sets. 

a) 'Relays' - PlanetLab with relays: In this experiment 
every trace is collected on a two-hop overlay path between 
three PlanetLab nodes: source, relay and destination. The 
UDP packets at the source are generated every T = 5ms 
and sent immediately to the relay that forwards them to the 
destination. After every one-second-long packet generation 
period we introduce one second of idle time in order to avoid 
dropping packets at PlanetLab hosts when the probing traffic is 
too bursty. We collected more than 5 '000 traces, each covering 
100 seconds of packet generation time. 

In order to further reduce the effect of overloaded PL nodes 
on the results, for every experiment separately we select the 
source, relay and destination randomly from 50 currently least 
loaded PL nodes. As the load estimate we use the number 
of processes queueing for the CPU and I/O devices; it can 
be obtained by parsing the file /proc/stat that stores the 
information about kernel activity. 

b) 'Web sites' - PlanetLab to popular web sites: This 
data set consists of 2'839 traces used in [10]. They were col- 



lected by sending 16-byte ICMP echo packets from 57 Plan- 
etLab hosts to 55 popular web sites selected from [21]. Next, 
the ICMP echo-reply packets were captured by Tcpdump, 
resulting in traces where packets travel from a PlanetLab node 
to a web site and back to the original PlanetLab node. The 
packets were sent every T = 2ms. As above, every one-second 
packet generation time was followed by one-second idle time. 
Each measurement lasted at least 800 seconds. As in [10], we 
split it into 40-second long intervals that we call chunks. 

Despite the measures we took, in both data sets we find 
traces with numerous long (100ms and more) blocks of 
consecutive losses. As this is not caused by network losses, but 
rather by buffer overflow at the nodes due to CPU queueing, 
we exclude these traces from our simulations. 

2) Simulation setting: In a simulation of a i?-path scenario 
we use R traces (one per path) randomly chosen from the pool 
of all available traces. Thus, by construction, the R traces are 
independent (typically generated at different times and places 
in the Internet). For the sake of simplicity, we restrict the 
presentation to the case R = 2. 

Our basic metric is loss rate improvement 7. As described 
in Section IIV-CI it optimizes the rates of Immediate and 
Spread. This optimization is based on the observed traces. 
One approach to do this is to infer for every path its loss rate 

(r) (r) 

tt b and the average loss burst length l/fi B , feed them into 
the model and optimize the rates as in section [VAl However, 
this technique has two drawbacks: it introduces errors when 
measuring the path properties, and assumes a particular packet 
loss model. We avoid these problems by working directly on 
the traces - the optimal rates in Sl™ t m and are those that 
perform best on a given chunk. 

We present two types of results. In Oracle we choose the 
optimal rates for the currently evaluated chunk. In contrast, in 
Prediction we use the optimal rates of the preceding chunk 
to evaluate the current chunk. Thus Oracle shows the best 
achievable results for Immediate and Spread with no prediction 
errors, whereas Prediction is a practical implementation. 

3) Results: In Fig.fTOlwe present the results for FEC(10. 8). 
The figure presents the cumulative distribution of the relative 
loss rate improvement 7 for At = 10ms and At = 50ms. We 
consider the cases where optimal Immediate uses both paths 
(i.e., niy^Q and m 7^ 10) and there is a space for improvement 
(i.e., Tr B (Sl™ t m ) > 0). In about 90% of cases we observe an 
advantage of Spread over Immediate. For instance, for both 
data sets under Oracle with At = 50ms, in 50% of cases the 
loss rate drops by a factor of 3 or more when we use Spread 
instead of Immediate. For smaller At the advantage is less 
pronounced, which is in agreement with the results presented 
in the previous section. 

Surprisingly, in roughly 10% of cases Spread performs 
slightly worse than Immediate. A possible explanation is that 
in some traces we can find loss patterns that are periodic, 
presumably due to other applications running on PlanetLab 
nodes. If such an unnatural loss pattern gets aligned with the 
packets scheduled by Spread on one or more paths, then the 
performance of Spread may drop below Immediate. 



'Relays' 

- 


s. \"»'~ - - : 




Oracle 

^— ^— Prediction 


) 1 


2 3 4 = 
Loss rate improvement 7 


'Web sites 




^^P^ 18 ; 




Oracle 

^— ^— Prediction 



1 1 — 1 ' 1 1 1 

1 2 3 4 5 

Loss rate improvement 7 



Fig. 10. The effective loss rate improvement 7 (by using Spread instead 
of Immediate) in trace-driven simulations under FEC(10, 8). We use R = 2 
independent paths with real-life loss traces; their propagation times differ by 
At. We consider two data sets: 'Relays' (top) and 'Web sites' (bottom). 

Finally, we find our simple prediction method satisfactory, 
as the Prediction curve is always close to Oracle. 

VI. Related work 

The performance of FEC on a single path with correlated 
loss failures was studied e.g., in [1,14,15]. One common 
conclusion is that the FEC efficiency drops with growing 
burstiness of packet losses. 

Multipath transmission as a way of de-correlating the packet 
losses and increasing the performance of FEC was first pro- 
posed in [3]. It got more attention recently, e.g., in [4]- 
[7,9,10]. Multipath was also studied in the context of Multiple 
Description Coding [11]. 

In [5] the authors study a multipath FEC system by simu- 
lations only, on artificially generated graphs. They also give a 
heuristic to select from a number of candidate paths a set of 
highly disjoint paths with relatively small propagation delays. 

There are a number of approaches to evaluate analytically 
the performance of multipath FEC with independent paths and 
bursty path losses. For instance, [4,6,7] and [10] give four 
different derivations of the effective loss rate ir* B (or related 
metrics) in such a setting. However, in all four cases the 
resulting formula is only an approximation of the complete 
solution due to (sometimes very significant) model simpli- 
fications. First, [6] [7] use a discrete Gilbert model. Thus 
two consecutive packets on one path are equally correlated 
irrespectively of the time intervals between them, which makes 
the models inherently unable to capture any aspects of varying 
packet spacing. [10] also uses the a discrete Gilbert model, but 
adapts the transition matrix appropriately. Second approxima- 
tion comes when computing the number of lost data packets 
given that a FEC block cannot be entirely recovered: [4] and 



[6] use approximations described at the end of section IIII-BI 
[7] simplifies the model by assuming that in such a case all 
data packets are lost, and [10] assumes that the numbers of lost 
data packets and redundancy packets are not correlated. Third, 
[6] considers only a scenario with identical loss statistics on 
every path. Finally, [10] assumes a large number of active 
paths R 3> 1 and small individual path rates n r <C n. This 
allows the authors to apply the central limit theorem and 
approximate the joint distribution of the number of lost data 
and redundancy packets by a bivariate normal distribution. 

To the best of our knowledge, we are the first ones to give 
an exact analytical formula for the effective loss rate n* B of 
FEC protection scheme on multiple independent paths with 
path losses modeled by the Gilbert model. 

As in most other approaches, we assume that the back- 
ground cross-traffic is much larger than our own, and thus the 
load we impose on a path does not affect its loss statistics. 
Scenarios where this assumption does not hold were studied 
in [16] in the context of a single path FEC, and in [22] for 
multipath FEC. 

As in [4,9,10,22] we assume the paths to be independent. 
This can be achieved by detecting correlated paths in end- 
to-end measurements [23] and treating them as one. Another 
approach is to find paths that are IP link disjoint, which should 
be possible if the site is multi-homed. Finally, even if all the 
available paths are to some extent correlated we can still get 
some performance benefits [5,6,8,24], though limited [25,26]. 

Finally and most importantly, to the best of our knowledge 
no attempt has been made to exploit the path propagation 
time differences in multipath FEC. Indeed, all the works listed 
above use some variant of the Immediate schedule, where 
packets are sent as soon as they arrive at the source. In contrast, 
in this paper we have proposed the Spread schedule that 
exploits these propagation time differences and significantly 
improves the performance. 

VII. Conclusion 

In this paper we started from the observation that the prop- 
agation times on multiple paths between a pair of nodes may 
significantly differ. We proposed to exploit these differences 
in the context of delay-constrained multipath systems using 
FEC, by applying the Spread schedule. We have evaluated 
our solution by a precise analytical approach, and simulations 
based on both the model and real-life Internet traces. Our 
studies show that Spread substantially outperforms previous 
solutions. It achieves a several-fold improvement (reduction) 
of the effective loss rate. Or conversely, keeping the same level 
of effective loss rate Spread significantly decreases the FEC 
block transmission time, which limits the observed delays and 
helps fighting the delay jitter. 
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IX. Appendix 



n 



A. Recursive equations 

Here we derive the probability P([?] \q) that any b out of a 
consecutive packets sent on a path P r (with packet interval T r ) 
are lost given that this block is preceded by a packet in state 
q € {G, B}. Although no general closed form of P([?] \q) 
is known, it can be calculated by the recursive approach first 
proposed in [17] and extended e.g. in [4,15]. Indeed, 

P([»] \B) = R(b + l,a + l) 
P([J] \G) = S (b + l,b-a + l), 

where functions R(m, n) and S(m, n) can be calculated as 
follows [15]: 



. P(n) for to =1 and n>l 

' " "' ' " ' = \ EZT +1 p(i)R(m-l, n-i) for 2 < to < n 

for m = 1 and n > 1 



S(m, n) 



Q(n) 

E^7 +1 q(i)S(m-l, n-i) for 2 < m < 



where 



p(i) 
Pii) 

Qii) 



1-q if i = 1 

q(l — p) l ~ 2 p otherwise 

1 if i = 1 

q(l — p) l ~ 2 otherwise 

1 — p if i = 1 

p(l — qY~ 2 q otherwise 

1 if i = 1 

p(l — q) l ~ 2 otherwise 

P = Pg]b( T t) -given by© 

1 = Pb c( T r) - given by © 



B. The effective loss rate -k* b for non-systematic multipath 
FEC 

All formulas shown so far were derived for the systematic 
version of FEC. The non-systematic FEC(n, k) is easier to 
handle, and leads to a simplification of these formulas, as 
follows. 

For an arbitrary schedule the derivation of (0 is the same, 
except that now the number D{c) of lost data packets for a 
given failure configuration c is 



D{c) 



if E"=i 1 {c 4 =B} <n-k 
k otherwise 



Consider now the equal spacing on paths. As the number 
of lost data packets is always k for at least n — k+1 lost FEC 
packets, the formula (fTTT l for the effective loss rate ir^ gets 



simplified to 

= \ E fc-P(F = i) = 

j=n— 
n 

= E E P(^i=ii.---,^=ifl) = 



j= n— fc+1 
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where P([£] \G) or P([£] \B) are calculated in lDTAl 
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